Deep Digging into the Generalization of Self-Supervised Monocular Depth Estimation

نویسندگان

چکیده

Self-supervised monocular depth estimation has been widely studied recently. Most of the work focused on improving performance benchmark datasets, such as KITTI, but offered a few experiments generalization performance. In this paper, we investigate backbone networks (e.g., CNNs, Transformers, and CNN-Transformer hybrid models) toward estimation. We first evaluate state-of-the-art models diverse public which have never seen during network training. Next, effects texture-biased shape-biased representations using various texture-shifted datasets that generated. observe Transformers exhibit strong shape bias CNNs do texture-bias. also find show better for compared to models. Based these observations, newly design with multi-level adaptive feature fusion module, called MonoFormer. The intuition behind MonoFormer is increase by employing while compensating weak locality adaptively fusing representations. Extensive proposed method achieves datasets. Our shows best ability among competitive methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Self-Supervised Monocular Image Depth Learning and Confidence Estimation

Convolutional Neural Networks (CNNs) need large amounts of data with ground truth annotation, which is a challenging problem that has limited the development and fast deployment of CNNs for many computer vision tasks. We propose a novel framework for depth estimation from monocular images with corresponding confidence in a selfsupervised manner. A fully differential patch-based cost function is...

متن کامل

A Compromise Principle in Deep Monocular Depth Estimation

Monocular depth estimation, which plays a key role in understanding 3D scene geometry, is fundamentally an illposed problem. Existing methods based on deep convolutional neural networks (DCNNs) have examined this problem by learning convolutional networks to estimate continuous depth maps from monocular images. However, we find that training a network to predict a high spatial resolution contin...

متن کامل

Qualitative Estimation of Depth in Monocular Vision

In this paper we propose two techniques to qualitatively estimate distance in monocular vision. Two kinds of approaches are described, the former based on texture analysis and the latter on histogram inspection. Although both the methods allow only to determine whether a point within an image is nearer or farther than another with respect to the observer, they can be usefully exploited in all t...

متن کامل

Digging deep into “dirty” drugs – modulation of the methylation machinery

DNA methylation and histone modification are epigenetic mechanisms that result in altered gene expression and cellular phenotype. The exact role of methylation in myelodysplastic syndromes (MDS) and acute myeloid leukemia (AML) remains unclear. However, aberrations (e.g. loss-/gain-of-function or up-/down-regulation) in components of epigenetic transcriptional regulation in general, and of the ...

متن کامل

Fusion of stereo and still monocular depth estimates in a self-supervised learning context

We study how autonomous robots can learn by themselves to improve their depth estimation capability. In particular, we investigate a self-supervised learning setup in which stereo vision depth estimates serve as targets for a convolutional neural network (CNN) that transforms a single still image to a dense depth map. After training, the stereo and mono estimates are fused with a novel fusion m...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i1.25090